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Input Set : N:\Crf3\RULE60\09964238.txt 
Output Set: N:\CRF3\11012001\I964238.raw 

APPLICANT: Vlaams Interuniversitair Instituut voor Biotechnology 
TITLE OF INVENTION: SMAD- INTERACTING POLYPEPTIDES AND THEIR USE 
FILE REFERENCE: 2676-4232US 
CURRENT APPLICATION NUMBER: 09/964,238 
CURRENT FILING DATE: 2001-09-26 
PRIOR APPLICATION NUMBER:. 09/449,285 

1999-11-24 
NUMBER: PCT/EP98/03193 

1998-05-28 
NUMBER: 97201645.5 
1997-06-02 



PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
PRIOR APPLICATION 
PRIOR FILING DATE 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patentln version 
SEQ ID NO: 1 
LENGTH: 3006 
TYPE : DNA 

ORGANISM: Mus musculus 
SEQUENCE : 1 
gcagcactca gcaccaaatg ctaacccaag 
agtgtggcaa ggccttcaag tacaagcacc 




3.0 



gagcaggtaa ccgcaagttc aagtgcacgg 
acctgaaaga acacctgaga attcacagtg 



gtgaaaaacc ttacgaatgc ccaaactgca agaaacgctt ctctcattct gggtcctaca 
gttcacatat cagcagcaag aaatgtattg gtttaatatc agtaaatggc cgaatgagaa 
acaatatcaa gacgggttct tcccctaatt ctgtttcttc ttctcctact aactcagcca 
ttactcagtt aaggaacaag ttggaaaatg gaaaaccact tagcatgtct gagcagacag 
gcttacttaa gattaaaaca gaaccactag acttcaatga ctataaagtt cttatggcaa 
cacatgggtt tagtggcagc agtcccttta tgaacggtgg gcttggagcc accagccctt 
taggtgtaca cccatctgct cagagtccaa tgcagcactt aggtgtaggg atggaagccc 
ctttacttgg atttcccact atgaatagta acttgagtga ggtacaaaag gttctacaga 
ttgtggacaa tacggtttct aggcaaaaga tggactgcaa gacggaagac atttcaaagt 

gatccatgtt ctcagccaga agaacaaggg gtaacttctc 
cttccagtag tgagtcataa cggtgccact aaaagtatta 



tgaaaggtta tcacatgaag 
ccaatattcc ccctgtcggt 



55 

57 ttgactatac cttagagaaa gtcaatgaag ccaaagcttg cctccagagc ttgaccaccg 

59 actcaaggag acagatcagt aacataaaga aagagaagtt gcgtactttg atagatttgg 

61 tcactgatga taaaatgatt gagaaccaca gcatatccac tccattttca tgccagttct 

6 3 gtaaagaaag cttcccgggc cctattcccc tgcatcagca tgaacgatac ctgtgtaaga 

65 tgaatgaaga gatcaaggca gtcctgcaac ctcatgaaaa catagtcccc aacaaagctg 

67 gagtttttgt tgataataaa gccctcctct tgtcatctgt actttccgag aaaggactga 

69 caagccccat caacccatac aaggaccaca tgtctgtact gaaagcatac tatgctatga 

71 acatggagcc caactctgat gaactgctga aaatctccat tgctgtgggc cttcctcagg 

aatttgtgaa ggaatggttt gagcaaagaa 

catcactgga aaggacctcc aagccgttag 

ctttgttacc caggtctcct gtaaaaccta 

aactccacaa cagtgttacg agttgtgatc 



aagtctacca gtattcgaat tccaggtcac 
ctcccaacag taaccccacc acaaaagact 
tggactccat cacatcgcca tctatagcag 
ctcctctcag gctaacaaaa tcttcccatt 



tcaccaatat taaagcagtt gataaactgg accactcgag gagtaatact ccttctcctt 



taaatctttc ctccacatct tctaaaaact 
tctcttccga ggagctgcag gctgagccgt 



cccacagtag ctcgtacact ccaaatagct 
tggacctgtc attaccaaaa caaatgagag 



87 aacccaaagg tattatagcc acaaagaaca aaacaaaagc tactagcata aacttagacc 
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RAW SEQUENCE LISTING DATE: 11/01/2001 

PATENT APPLICATION: US/09/964 , 23 8 TIME: 13:41:55 

Input Set : N:\Crf3\RULE60\09964238.txt 
Output Set: N:\CRF3\11012001\I964238.raw 

89 acaacagtgt ttcttcatcg tctgagaatt cagatgagcc tctgaatttg acttttatca 1800 

91 agaaagagtt ttcaaattct aataacctgg acaataaaag caacaaccct gtgttcggca 1860 

93 tgaacccatt tagtgccaag cctttataca cccctcttcc accacagagc gcatttcccc 1920 

95 ctgccacttt catgccacca gtccagacca gcatccccgg gctacgacca tacccaggac 1980 

97 tggatcagat gagcttccta ccgcatatgg cctataccta cccaacggga gcagctacct 2040 

99 ttgctgatat gcagcaaagg aggaaatacc agaggaaaca aggatttcag ggagacttgc 2100 

101 tggatggagc acaagactac atgtcaggcc tagatgacat gacagactcc gattcctgtc 2160 

103 tgtctcgaaa gaagataaag aagacagaaa gtggcatgta tgcatgtgac ttatgtgaca 2220 

105 agacattcca gaaaagcagt tcccttctgc gacataaata cgaacacaca ggaaagagac 2280 

107 cacaccagtg tcagatttgt aagaaagcgt tcaaacacaa acaccacctt atcgagcact 234 0 

109 cgaggctgca ctcgggcgag aagccctatc agtgtgacaa atgtggcaag cgcttctcac 24 00 

111 actcgggctc ctactcgcag cacatgaatc acaggtactc ctactgcaag cgggaggcgg 2460 

113 aggagcggga agcagccgag cgcgaggcgc gagagaaagg gcacttggga cccaccgagc 2520 

115 tgctgatgaa ccgggcttac ctgcagagca tcacccctca ggggtactct gactcggagg 2580 

117 agagggagag catgccgagg gatggcgaga gcgagaagga gcacgagaag gagggcgagg 2640 

119 agggttatgg gaagctgcgg agaagggacg gcgacgagga ggaagaggag gaagaggaag 2700 

121 aaagtgaaaa taaaagtatg gatacggatc ccgaaacgat acgggatgag gaagagactg 2760 

123 gggatcactc gatggacgac agttcagagg atgggaaaat ggaaaccaaa tcagaccacg 2820 

125 aggaagacaa tatggaagat ggcatgggat aaactactgc attttaagct tcctattttt 2880 

127 tttttccagt agtattgtta cctgcttgaa aacactgctg tgttaagctg ttcatgcacg 2940 

129 tgcctgacgc ttccaggaag ctgtagagag ggacaaaaag gggcacttca gccaagtctg 3000 

131 agttag 3006 

134 <210> SEQ ID NO: 2 

135 <211> LENGTH: 944 

136 <212> TYPE: PRT 

137 <213> ORGANISM: Mus musculus 
139 <400> SEQUENCE: 2 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/964,23 8 



DATE: 11/01/2001 
TIME: 13:41:55 



Input Set : N:\Crf3\RULE60\09964238.txt 
Output Set: N:\CRF3\11012001\I964238.raw 



174 Leu Gly Phe Pro Thr Met Asn Ser Asn Leu Ser Glu Val Gin Lys Val 

175 180 185 190 

177 Leu Gin lie Val Asp Asn Thr Val Ser Arg Gin Lys Met Asp Cys Lys 

178 195 200 205 

180 Thr Glu Asp lie Ser Lys Leu Lys Gly Tyr His Met Lys Asp Pro Cys 

181 210 215 220 

183 Ser Gin Pro Glu Glu Gin Gly Val Thr Ser Pro Asn lie Pro Pro Val 

184 225 230 235 240 

186 Gly Leu Pro Val Val Ser. His Asn Gly Ala Thr Lys Ser lie lie Asp 

187 245 250 255 

189 Tyr Thr Leu Glu Lys Val Asn Glu Ala Lys Ala Cys Leu Gin Ser Leu 

190 260 265 270 

192 Thr Thr Asp Ser Arg Arg Gin lie Ser Asn lie Lys Lys Glu Lys Leu 

193 275 280 285 

195 Arg Thr Leu lie Asp Leu Val Thr Asp Asp Lys Met lie Glu Asn His 

196 290 295 300 

198 Ser lie Ser Thr Pro Phe Ser Cys Gin Phe Cys Lys Glu Ser Phe Pro 

199 305 310 315 320 

201 Gly Pro lie Pro Leu His Gin His Glu Arg Tyr Leu Cys Lys Met Asn 

202 325 330 335 

204 Glu Glu lie Lys Ala Val Leu Gin Pro His Glu Asn lie Val Pro Asn 

205 340 345 350 

207 Lys Ala Gly Val Phe Val Asp Asn Lys Ala Leu Leu Leu Ser Ser Val 

208 355 360 365 

210 Leu Ser Glu Lys Gly Leu Thr Ser Pro lie Asn Pro Tyr Lys Asp His 

211 370 375 380 

213 Met Ser Val lieu Lys Ala Tyr Tyr Ala Met Asn Met Glu Pro Asn Ser 

214 385 390 , 395 400 

216 Asp Glu Leu Leu Lys lie Ser lie Ala Val Gly Leu Pro Gin Glu Phe 

217 405 410 415 

219 Val Lys Glu Trp Phe Glu Gin Arg Lys Val Tyr Gin Tyr Ser Asn Ser 

220 420 425 430 

222 Arg Ser Pro Ser Leu Glu Arg Thr Ser Lys Pro Leu Ala Pro Asn Ser 

223 435 440 445 

225 Asn Pro Thr Thr Lys Asp Ser Leu Leu Pro Arg Ser Pro Val Lys Pro 

226 450 455 460 

228 Met Asp Ser lie Thr Ser Pro Ser lie Ala Glu Leu His Asn Ser Val 

229 465 470 475 480 

231 Thr Ser Cys Asp Pro Pro Leu Arg Leu Thr Lys Ser Ser His Phe Thr 

232 485 490 495 

234 Asn lie Lys Ala Val Asp Lys Leu Asp His Ser Arg Ser Asn Thr Pro 

235 500 505 510 

237 Ser Pro Leu Asn Leu Ser Ser Thr Ser Ser Lys Asn Ser His Ser Ser 

238 515 520 525 

240 Ser Tyr Thr Pro Asn Ser Phe Ser Ser Glu Glu Leu Gin Ala Glu Pro 

241 530 535 540 

243 Leu Asp Leu Ser Leu Pro Lys Gin Met Arg Glu Pro Lys Gly lie lie 

244 545 550 555 560 
246 Ala Thr Lys Asn Lys Thr Lys Ala Thr Ser lie Asn Leu Asp His Asn 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/964 , 238 



DATE: 11/01/2001 
TIME: 13:41:55 



Input Set : N:\Crf3\RULE60\09964238.txt 
Output Set: N:\CRF3\11012001\I964238.raw 



247 



565 



570 



575 



249 Ser Val Ser Ser Ser Ser Glu Asn Ser Asp Glu Pro Leu Asn Leu Thr 

250 580 585 590 

252 Phe lie Lys Lys Glu Phe Ser Asn Ser Asn Asn Leu Asp Asn Lys Ser 

253 595 600 605 

2 55 Asn Asn Pro Val Phe Gly Met Asn Pro Phe Ser Ala Lys Pro Leu Tyr 
256 610 615 620 

258 Thr Pro Leu Pro Pro Gin Ser Ala Phe Pro Pro Ala Thr Phe Met Pro 

259 625 630 635 640 

261 Pro Val Gin Thr Ser lie Pro Gly Leu Arg Pro Tyr Pro Gly Leu Asp 

262 645 650 , 655 

264 Gin Met Ser Phe Leu Pro His Met Ala Tyr Thr Tyr Pro Thr Gly Ala 

265 660 665 670 

267 Ala Thr Phe Ala Asp Met Gin Gin Arg Arg Lys Tyr Gin Arg Lys Gin 

268 675 680 685 

270 Gly Phe Gin Gly Asp Leu Leu Asp Gly Ala Gin Asp Tyr Met Ser Gly 

271 690 695 700 

27 3 Leu Asp Asp Met Thr Asp Ser Asp Ser Cys Leu Ser Arg Lys Lys lie 
274 705 710 715 720 

276 Lys Lys Thr Glu Ser Gly Met Tyr Ala Cys Asp Leu Cys Asp Lys Thr 

277 725 730 735 

279 Phe Gin Lys Ser Ser Ser Leu Leu Arg His Lys Tyr Glu His Thr Gly 

280 740 745 750 

282 Lys Arg Pro His Gin Cys Gin lie Cys Lys Lys Ala Phe Lys His Lys 

283 755 760 765 

285 His His Leu lie Glu His Ser Arg Leu His Ser Gly Glu Lys Pro Tyr 

286 770 775 780 

288 Gin Cys Asp Lys Cys Gly Lys Arg Phe Ser His Ser Gly Ser Tyr Ser 

289 785 790 795 800 

291 Gin His Met Asn His Arg Tyr Ser Tyr Cys Lys Arg Glu Ala Glu Glu 

292 805 810 815 

294 Arg Glu Ala Ala Glu Arg Glu Ala Arg Glu Lys Gly His Leu Gly Pro 

295 820 825 830 

297 Thr Glu Leu Leu Met Asn Arg Ala Tyr Leu Gin Ser lie Thr Pro Gin 

298 835 840 845 

300 Gly Tyr Ser Asp Ser Glu Glu Arg Glu Ser Met Pro Arg Asp Gly Glu 

301 850 855 860 

303 Ser Glu Lys Glu His Glu Lys Glu Gly Glu Glu Gly Tyr Gly Lys Leu 

304 865 870 875 880 

306 Arg Arg Arg Asp Gly Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Ser 

307 885 890 895 

309 Glu Asn Lys Ser Met Asp Thr Asp Pro Glu Thr lie Arg Asp Glu Glu 

310 900 905 910 

312 Glu Thr Gly Asp His Ser Met Asp Asp Ser Ser Glu Asp Gly Lys Met 

313 915 920 925 

315 Glu Thr Lys Ser Asp His Glu Glu Asp Asn Met Glu Asp Gly Met Gly 

316 930 935 940 

318 <210> SEQ ID NO: 3 

319 <211> LENGTH: 2959 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/964,238 



DATE: 11/01/2001 
TIME: 13:41:55 



Input Set : N:\Crf3\RULE60\09964238.txt 
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320 <212> TYPE: DNA 

321 <213> ORGANISM: Mus musculus 

323 <400> SEQUENCE: 3 

324 ctggctaggc gtcgcggact ccggagatgg aggaaaagga gcagctgcgg cggcagatac 
326 gcctcctgca gggtctaatt gatgactata aaacactcca cggcaatggc cctgccctgg 
32 8 gcaactcatc agctactcgg tggcagccac ccgtgttccc gggtggcagg acctttggcg 
330 cccgctactc ccgtccaagt cggaggggct tctcctcaca ccatggccct tcgtggcgca 
332 agaaatactc ccttgtgaat cagcctgtgg aatcttctga cccagccagc gatcctgctt 
334 ttcagacatc cctcaggtct gaggatagcc agcatcctga accccagcag tatgtactgg 
336 agagacaggt ccagctcagt ccagatcaga atatggttat taagatcaag ccaccatcaa 
338 agtcaggtgc catcaatgct tcaggggtcc agcgggggtc cttggaaggc tgtgatgacc 
340 cctcttggag tggccaaaga ccccaaggaa gtgaggttga ggtccctggt ggacaactgc 
342 agcctgcaag gccaggaaga accaaggtgg gttacagtgt ggacgacccc ctcttggtct 
344 gccagaagga gcctggcaag cctcgggtag tgaagtctgt gggcagggtg agtgacagct 
34 6 ctcccgagca tcggcggaca gtcagtgaaa atgaagtggc cctcagggta cacttcccat 
34 8 ctgtcctgcc ccatcacact gctgtggctc tgggcaggaa ggtaggccct cattctacca 
350 gctattctga acagttcatt ggagaccaaa gagcaaacac tggccactca gaccagccag 
352 cttccttggg gccagtggtg gcttcagtca gaccagcaac agccaggcag gtcagggagg 
354 cctcactgct cgtgtcctgt cgaaccagca agtttcggaa aaacaactac aaatgggtag 
356 ctgcctcaga aaagagccca cgggtcgctc ggagagccct cagtcccaga acaactctgg 
358 agagcgggaa caaggccact ttgggtacag ttggaaagac agagaagcca cagcctaaag* 
360 ttgacccaga ggtgaggccg gagaaactgg ccacaccatc caagcctggc ctctctccca 
362 gcaagtacaa gtggaaggct tccagcccgt ctgcttcctc ctcttcctct ttccgttggc 
364 agtctgaggc tggcagcaag gaccatactt ctcagctctc cccagtccca tctaggccca 
366 catcagggga cagaccagca gggggaccca gcagcttgaa gcccctcttt ggagagtcac 
368 agctctcagc ttacaaagtg aagagccgga ccaagattat ccggaggcgg ggcaatacca 
370 gcattcctgg ggacaagaag aacagcccta caactgccac caccagcaaa aaccatctta 
372 cccagcgacg gagacaggcc ctccggggga agaatagccc ggttctaagg aagactcccc 
374 acaagggtct gatgcaggtc aacaggcacc ggctctgctg cctgccgtcc agccggaccc 
376 acctctccac caaggaagct tccagtgtgc acatggggat tccaccctcc aataaggtga 
378 tcaagacccg ctaccgcatt gttaagaaga ccccaagctc ttcctttggt gctccatcct 
380 tcccctcatc tctaccctcc tggcgggccc ggcgcatccc attatccagg tccctagtgc 
382 taaaccgcct tcgtccagca atcactgggg gagggaaagc cccacctggt acccctcgat 
384 ggcgcaacaa aggctaccgc tgcattggag gggttctgta caaggtgtct gccaacaagc 
386 tctccaaaac ttctagcagg cccagtgatg gcaacaggac cctcctccgc acaggacgcc 
388 tggaccctgc taccacctgc agtcgttcct tggccagccg ggccatccag cggagcctgg 
390 ctatcatccg gcaggcgaag cagaagaaag agaagaagag agagtactgc atgtactaca 
392 accgctttgg caggtgtaac cgtggcgaat gctgccccta catccatgac cctgagaagg 
394 tggccgtgtg caccagattt gtccgaggca catgcaagaa gacagatggg tcctgccctt 
396 tctctcacca tgtgtccaag gaaaagatgc ctgtgtgctc ctactttctg aaggggatct 
398 gcagcaacag caactgcccc tacagccatg tgtacgtgtc ccgcaaggct gaagtctgca 
400 gtgacttcct caaaggctac tgcccattgg gtgcaaagtg caagaagaag cacacgctgc 
402 tgtgtcctga ctttgcccgc aggggtattt gtccccgtgg ctcccagtgc cagctgctcc 
404 atcgtaacca gaagcgacat ggccggcgga cagctgcacc tcctatccct gggcccagtg 
406 atggagcccc cagaagcaag gcctcagctg gccacgtact caggaagcct actactactc 
408 agcgctctgt cagacagatg tccagtggtc tggcttccgg agctgaggcc ccagcctccc 
410 cacctccctc cccaagggta ttagcctcca cctctaccct gtcttcaaag gccaccgctg 
412 cctcctctcc ttccccctct ccctctacta gctccccagc cccttccttg gagcaggaag 
414 aagctgtctc tgggacaggc tcaggaacag gctccagtgg cctctgcaag ctgccatcct 
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VERIFICATION SUMMARY DATE : 11/01/2001 

PATENT APPLICATION : US/09/964,238 TIME: 13:41:56 

Input Set : N:\Crf3\RULE60\09964238.txt 
Output Set: N:\CRF3\11012001\I964238.raw 

L:652 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 5 

L:732 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 

L:736 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 

L:738 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 

L:804 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 

L:886 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 11 

L:888 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:11 

L:892 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 11 

L:923 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:12 

L:925 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 12 

L:950 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 

L:952 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 

L:956 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 

L:958 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:13 

L:985 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 14 
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